Model Selection

f16 embedding optimization

# f16 embedding optimization

ZeroWw is a quantized text generation model that uses f16 format for output and embedding tensors, while other tensors use q5_k or q6_k format, resulting in a smaller size with performance comparable to pure f16.

Large Language Model English

A quantized text generation model with output and embedding tensors in f16 format, while other tensors use q5_k or q6_k quantization, resulting in a smaller size with performance comparable to the pure f16 version.

Large Language Model English

Gemma 3 4b It Abliterated GGUF

An innovative quantization solution that achieves smaller model size while maintaining high performance through mixed-precision quantization.

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase